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invention is secreted into the growth medium of recombinant host cells. 
Alternatively, the protein or polypeptide of the present invention is produced but not 
secreted into growth medium. In such cases, to isolate the protein, the host cell (e.g., 
E. coli) carrying a recombinant plasmid is propagated, lysed by sonication, heat, or 
5 chemical treatment, and the homogenate is centrifuged to remove bacterial debris. 
The supernatant is then subjected to sequential ammonium sulfate precipitation. The 
fraction containing the polypeptide or protein of the present invention is subjected to 
gel filtration in an appropriately sized dextran or polyacrylamide column to separate 
the proteins. If necessary, the protein fraction may be further purified by HPLC. 

1 0 The DNA molecule encoding the ligase of the present invention can be 

incorporated in cells using conventional recombinant DNA technology. Generally, 
this involves inserting the DNA molecule into an expression system to which the 
DNA molecule is heterologous (i.e. not normally present). The heterologous DNA 
molecule is inserted into the expression system or vector in proper sense orientation 

1 5 and correct reading frame. The vector contains the necessary elements for the 
transcription and translation of the inserted protein-coding sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 
incorporated by reference, describes the production of expression systems in the form 
of recombinant plasmids using restriction enzyme cleavage and ligation with DNA 

20 ligase. These recombinant plasmids are then introduced by means of transformation 
and replicated in unicellular cultures including procaryotic organisms and eucaryotic 
cells grown in tissue culture. 

Recombinant genes may also be introduced into viruses, such as 
vaccina virus. Recombinant viruses can be generated by transfection of plasmids into 

25 cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid vectors 
such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, 
pLG339, pR290, pKC37, pKClOl, SV 40, pBluescript II SK +/- or KS +/- (see 

30 "Stratagene Cloning Systems" Catalog (1 993) from Stratagene, La Jolla, Calif, which 
is hereby incorporated by reference), pQE, pIH821 , pGEX, pET series (see F.W. 
Studier et. a!., "Use of T7 RNA Polymerase to Direct Expression of Cloned Genes," 
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Gene Expression Technology vol. 185 (1990), which is hereby incorporated by 
reference), and any derivatives thereof. Recombinant molecules can be introduced 
into cells via transformation, particularly transduction, conjugation, mobilization, or 
electroporation. The DNA sequences are cloned into the vector using standard 
5 cloning procedures in the art, as described by Sambrook et al, Molecular Cloning: A 
Laboratory Manual . Cold Springs Laboratory, Cold Springs Harbor, New York 
(1989), which is hereby incorporated by reference. 

A variety of host- vector systems may be utilized to express the protein- 
encoding sequence(s). Primarily, the vector system must be compatible with the host 

1 0 cell used. Host- vector systems include but are not limited to the following: bacteria 
transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 
microorganisms such as yeast containing yeast vectors; mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); and plant cells infected by bacteria. The expression 

1 5 elements of these vectors vary in their strength and specificities. Depending upon the 
host-vector system utilized, any one of a number of suitable transcription and 
translation elements can be used. 

Different genetic signals and processing events control many levels of 
gene expression (e.g., DNA transcription and messenger RNA (mRNA) translation). 

20 Transcription of DNA is dependent upon the presence of a promotor 

which is a DNA sequence that directs the binding of RNA polymerase and thereby 
promotes mRNA synthesis. The DNA sequences of eucaryotic promotors differ from 
those of procaryotic promotors. Furthermore, eucaryotic promotors and 
accompanying genetic signals may not be recognized in or may not function in a 

25 procaryotic system, and, further, procaryotic promotors are not recognized and do not 
function in eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon the 
presence of the proper procaryotic signals which differ from those of eucaryotes. 
Efficient translation of mRNA in procaryotes requires a ribosome binding site called 

30 the Shine-Dalgarno ("SD") sequence on the mRNA. This sequence is a short 

nucleotide sequence of mRNA that is located before the start codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 
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complementary to the 3'-end of the 16S rRNA (ribosomal RNA) and probably 
promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see 
Roberts and Lauer, Methods in Enzymology . 68:473 (1979), which is hereby 
5 incorporated by reference. 

Promotors vary in their "strength" (i.e. their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promotors in order to obtain a high level of transcription and, hence, 
expression of the gene. Depending upon the host cell system utilized, any one of a 

1 0 number of suitable promotors may be used. For instance, when cloning in E. coli, its 
bacteriophages, or plasmids, promotors such as the T7 phage promoter, lac promotor, 
trp promotor, recA promotor, ribosomal RNA promotor, the Pr and Pl promotors of 
coliphage lambda and others, including but not limited, to lacUV5, ompF, bla, Ipp, 
and the like, may be used to direct high levels of transcription of adjacent DNA 

1 5 segments. Additionally, a hybrid trp-lac\JV5 (tac) promotor or other E. coli 

promotors produced by recombinant DNA or other synthetic DNA techniques may be 
used to provide for transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promotor unless specifically induced. In certain operations, 

20 the addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 
(isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 
are under different controls. 

Specific initiation signals are also required for efficient gene 

25 transcription and translation in procaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 
which contains a promotor, may also contain any combination of various "strong" 
transcription and/or translation initiation signals. For instance, efficient translation in 

30 E. coli requires an SD sequence about 7-9 bases 5' to the initiation codon ("ATG") to 
provide a ribosome binding site. Thus, any SD-ATG combination that can be utilized 
by host cell ribosomes may be employed. Such combinations include but are not 



